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Abstract 

Learning and adaptation is a fundamental prop- 
erty of intelligent agents. In the context of adap- 
tive information filtering, a filtering agent's beliefs 
about a user's information needs have to be re- 
vised regularly with reference to the user's most 
current information preferences. This learning 
and adaptation process is essential for maintain- 
ing the agent's filtering performance. The ACM 
belief revision paradigm provides a rigorous foun- 
dation for modelling rational and minimal changes 
to an agent's beliefs. In particular, the maxi- 



adjustment method, which follows the ACM ra- 
tionale of belief change, offers a Sound and ro- 
bust computational mechanism to develop adap- 
tive agents so that learning autonomy of these 
agents can be enhanced. This paper describes how 
the maxi-adjustment method is applied to develop 
the learning components of adaptive information 
filtering agents, and discusses possible difficulties 
of applying such a framework to these agents. 



Introduction 

With the explosive growth of the Internet and the 
World Wide Web ( Web), it is becoming increasingly dif- 
ficult for users to retrieve relevant information. This is 
the so-called problem of information overload on the In- 
ternet. Augmenting existing Internet search tools with 
personalised information filtering agents is one possible 
method to alleviate this problem. Adaptive information 
filtering agents are computer systems situated on the 
Web. They autonomously filter the incoming stream of 
information on behalf of the users. Users' information 
needs will change over time, and information filtering 
agents must be able to revise their beliefs about the 
users' information needs so that the accuracy of the fil- 
tering process can be maintained . The AGM belief 



revision p aradigm (Alchourron, Gardcnfors, & Makin- 
jon 1985| ) provides a rich and rigorous foundation for 
modelling such revision processes. It enables an agent 
to modify its beliefs in a rational and minimal w ay. 
Maxi-adjustment ( [Williams 199e| |Williams 1997eJ ) is 
a sp ecific change strategy that follows the AGM's ratio- 
nale of belief revision. In particular, it transmutes the 



underlying entrenchment ranking of beliefs in an abso- 
lute minimal way under maxim al information inertia . 
In informati on retrieval models ( Salton fe McGill 1983) ; 
Salton 1989), information objects are often assumed in- 
dependent unless semantic relationships among them 
can be derived. This intuition coincides with the un- 
derlying assumption of the maxi-adjustment strategy. 
The advantage of employing the maxi-adjustment strat- 
egy as the agents' learning mechanism is that seman- 
tic relationships among information items can be taken 
into account during the agents' learning and adapta- 
tion processes. L ess users' relevance feedback (Salton 



fc Buckley 1990 ) is required to train the filtering 



agents, and hence a higher level of learning auton- 
omy can be achieved when compared with the other 
learnin g approaches employed in adaptive information 
agents (IBillsus fc Pazzani 1999|; |Moukas fc Maes 1998] ; 



Balabanovic 1997; Pazzani, Muramatsu, fc Billsus 199(: ; 



Armstrong et al. 1995| ). This mper focuses on the ap 
plication of the maxi-adjustment method to the devel- 
opment of learning mechanisms in adaptive information 
filtering agents. Moreover, difficulties of applying such 
a framework to the filtering agents are discussed. 

The Adaptive Filtering Agent 

Figure E] is an overview of the major functional compo- 
nents of an adaptive information filtering agent. The 
focus of this paper is on the learning component of 
the adaptive information filtering agent. The filtering 
agent's memory holds the current representation of a 
user's infor mation needs. In particular, the notion of 



belief state ( Gardenfors 1988 ) is used to represent these 
information needs. The learning component accepts 
a user's relevance feedback about filtered Web docu- 
ments. Based on the proposed induction method, these 
feedback is converted to the beliefs of a user's infor- 
mation needs. The maxi-adjustment strategy is then 
applied to revise the beliefs stored in the agent's mem- 
ory with respect to these newly induced beliefs. As the 
user's information needs will change over time, such a 
belief revision process needs to be conducted repeat- 
edly. Technically, the learning behavior demons trated 



by the ag ent is a kind of reinforcement learning (Lang- 



ley 1996). The matching component filters out relevant 



information from a stream of incoming Web documents. 
It is underpinned by logical deduction. In other words, 
if the representation of a Web document is logically en- 
tailed by the set of beliefs stored in the agent's memory, 
the Web document will be considered as relevant and 
presented to the user. According to the user's relevance 
feedback, new beliefs may be added to, or existing be- 
liefs may be contracted from the agent's memory. The 
adaptive information filtering agent is one of the main 
elements in the agent-based information fi ltering sys- 
tem (AIFS) QLau, Hofstede, fe Bruza 1999|). 
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Figure 1: An Overview of Adaptive Information Filter- 
ing Agent 



The AGM Belief Revision Paradigm 

The AGM 



Ac 



par adigm (Alchourron, Gardenfors, 
inson 1985|) provides a rigorous foundation for 



modelling consistent and minimal changes to an agent's 
beliefs. In particular, belief revision is taken as tran- 
sitions among belief states ( |Gardcnfors 1988] ). A be- 
lief state can be represented by a belief s et K which is 



a t heory in a p ropositional language C (Gardenfors & 
Mai inson 1988). For the discussion in this paper, it is 



In the AGM framework flAlchourron, Gardenfors, & 


Mai inson 19851 


Gardenfors 198S; Gardenfors 1992; 


Gardenfors & Ma 


dnson 1988|), three principle types of 



Deuel state transitions are identified and modelled by 
corresponding belief functions: expansion (K£), con- 
traction (K~), and revision (AT*). The process of be- 
lief revision can be derived from the process of belief 
contraction and vice versa through the so-called Levi 
Identity i.e. K* = (AT~ Q )+, and the Harper Iden- 
tity i.e. K~ = K R A* Q . Essentially, the AGM 
framework proposes sets of postulates to characterise 
these functions such that they adhere to the ratio- 
nales of consistent and minimal changes. In addi- 
tion, it also describes the constructions of these func- 
tions based on various mechanisms. One of them 



1988) 



is e pistem ic entrenchment (Gardenfors & Makin 
For instance 





son 

lief set K, a 
trenched as a 



if a, are beliefs in a be- 
$^ means that is at least as en- 
If inconsistency arises after applying 



changes to a belief set, beliefs with the lowest degree 
of epistemic entrenchme nt arc given up. Technically , 
epistemic entrenc hment ( pardenfors fc Makinson 1988] ; 
Gardenfors 1992| ) is a total preorder of the sentences 
(e.g. a, 0, 7) in £, and is characterised by the follow- 
ing postulates: (EE1): If a ^ and ^ 7, then 
a ^ 7; (EE2): If a h 0, then a < 0; (EE3): For 
any a and 0, a ^ a A or ^ a A 0; (EE4): When 
K ^ K±, a £ K iff a «S 0, for all 0; (EE5): If < a 
for all 0, then h a. 

It has been proved that an unique contraction func- 
tion can be defined by the underlying epistemic en- 
trenchment thr ough the (C-) condition ( Gardenfors fc 



Makinson 1988| ): 



(C-) 



A"; 



{0 E K : a < a V 0} if \f a 
K otherwise 



where < is the strict part of epistemic entrench- 
ment _di?fiiieii_alx)ve^ Moreover, the (C-R) condi- 



tion ( Gardenfors 1992 ) also ensures that if an ordering 
of beliefs satisfies (EE1)-(EE5), the contraction func- 
tion, uniquely determined by (C-R), satisfies all but 
the recovery postulates for contraction. 

{0 G K : a < 0} if \f a 
K otherwise 



(C-R) 



k: 



Nevertheless, for a computer based implementation, 
a finite representation of epistemic entrenchment or- 
dering and a poli cy of iterated belief changes ar e re- 
quired. Williams (Williams 1995; Williams 1997b) pro- 
posed the finite partial entrenchment ranking (B) that 
ranked the sentences of a theory in C with the mini- 
mum possible deg ree of entrenchment (^b)- More over, 



maxi- adjustment (Williams 1996; Williams 1997a) was 
proposed to transmute a finite partial ranking using 
an absolute measure of minimal change under max- 
imal information inertia. Belief revision is not just 
taken as adding or deleting a sentence from a the- 
ory but the transmutati on of the underlying entrench- 
ment ranking. W illiams ( Williams 1995j ; Williams 1996j ; 
Williams 1997a ) formally defined the following defini- 
tions for a computational model of belief revision. 

Definition 1 A finite partial entrenchment ranking is 
a function B that maps a finite subset of sentences in C 
into the interval [0, O] such that the following conditions 
are satisfied for all a G rfom(B): 

(PERI) {0 G rfom(B) : B(a) < B(0)} \f a. 

(PER2) If\-->a then B(a) = 0. 

(PER3) B(a) = O if and only if ha. 

The set of all partial entrenchment rankings is de- 
noted B. B(a) is referred as the degree of acceptance 
of a. The explicit information content of B G B is 
{a G dom(B) : B(a) > 0}, and is denoted exp(B). 
Similarly, the implicit information content represented 
by B G B is Cn(exp(B)), and is denoted content(B). 
Cn is the classical consequence operator. In order to 
describe the epistemic entrenchment ordering (^b) gen- 
erated from a finite partial entrenchment ranking B, it 
is necessary to rank implicit sentences. 



2 



De finition 2 Let a be a non tautological sentence. Let 



B b ; a finite partial entrenchment ranking. The degree 



of acceptance of a is defined as: 



degree(B, a) — 



largest j such that 

{13 e exp{B) : B(/3) >j}ha 

if a £ content(B) 
otherwise 



The maxi- adjustment strategy transmutes a partial 
entrenchment ranking B based on the rationale of ab- 
solute minimal change under maximal information in- 
ertia. It is assumed that sentences in exp(B) (e.g. a,/?) 
are independent unless logical dependence exists be- 
tween them. In particular, a is defined as a reason 
of (3 if and only if degree(B, a — > (3) > B(/3). 

Definition 3 Let B 6 B be finite. The range of B 
is enumerated in ascending order as jo, ji, j2, jo- 
Let a be a contingent sentence, j m — degree(B, a) and 
< i < O. Then the (a,i) maxi- adjustment of B is 
B*(a,i) defined by: 



B*(a,i) = 



_ / (B-(cM)) 



if i < jn 



(B (-ia, 0)) + (a, i) otherwise 



where for all R £ dnm (Tj\ R (n- ?.) is defined as fnl- 



lowt 

1. For 13 with B(/3) > j m , B~(a,i)(/3) = B(/3). 

For /3 wif/i i < B(/3) < j m; assuming that 
B~(a,i)(/3) /or /3 is defined with B(/3) > j m -fc /or 



-1,0,1,2, 



B-(a,*)09) 



. , n — 1, tften /or /3 tuit/i B(/3) = j m -n, 

i if a \- [3 or 
a\f (3 and 13 £ T 
where V is a minimal subset of 
{7 : B(7) = j m -n} such that 
{7 : B-(a,i)(7) >Jm-„}urha 
B(/3) otherwise 

3. For (3 with B(/3) < i , B"(a, i)(/3) = B(/3). 
For aZZ /? S dom(B) U {a}, B + (a,i) is defined as 
follows: 

B(/3) z/B(/3)> i 
i if a = (3 or 

B(/3) < i < de#ree(B, a -> j9) 
degree (B, a — > /3) otherwise 

It has been stated that if i > then content (B* (a, i)) 
is an AGM revision, and conient(B*(a, 0)) satisfies 
all b ut the recove ry postulates for AGM contrac- 
tion dWilliams 1996|). 



B+(cm)(/3) 



Knowledge Representation 

A Web page is characterised by a set of weighted key- 
words base d on traditional information retrieval (IR) 
techniques ( |Salton fc McGill 198 |Salton 1989| ). At 
the symbolic level, each keyword k is mapped to the 
ground term of the positive keyword predicate pkw i.e. 
pkw(k). Basically, pkw(k) is a proposition since its in- 
terpretation is either true or false. The intended in- 
terpretation of these sentences is that they are satis- 
fled in a document D i.e. D \= pkw{k) if D is taken 



as a model ( phiaramella fc Chevallet 1992 ; Lalmas 
1998). For example, if d = { business, commerce, trade, 



} is the document representation at the keyword 
level, the corresponding representation at the symbolic 
level will be d = { pkw(business), pkw (commerce), 
pkw(trade), . . .}. Similarly, a user's information needs 
are also represented as a set of weighted keywords 
at the keyword level. However, this set of weighted 
keywords is derived from a set of relevant documents 
D + and a set of non-relevant documen ts D_ with 
respect to the user's informa tion needs ( Allan 1996| ; 
Buckley, Salton, & Allan 1994). Based on the frequen- 
cies of these keywords appearing in D + and D~ , it is 
possible to induce a preference ordering among the key- 
words with respect to the user's information needs. The 
basic idea is that a keyword appearing more frequently 
in D + is a more preferred keyword than another key- 
word that appears less frequently in D + . Once this pref- 
erence ordering is induced, it is taken as the epistemic 
entrenchment ordering of the corresponding beliefs. It 
is observed that the postulates of epistemic entrench- 
ment is valid in the context of informa tion retrieval in 
general (Lau, Hofstede, & Bruza 199E). For example, 



if \a,[3,~f\ is a set of information carriers (Bruza & 
Huibers 1994), a < (3 and (3 < 7 implies a < 7. In 



other words, if an information searcher prefers informa- 
tion carrier 7 rather than information carrier (3, and 
information carrier (3 rather than information carrier 
a, they prefer retrieving 7 rather than a. This char- 
acteristic of information carriers matches the epistemic 
entrenchment postulate e.g. (EE1) of beliefs. 

Moreover, it is necessary t o classify a keywo rd as 
positive, neutral, or negative (Kindo et al. 1997). In- 
tuitively, positive keywords represent the information 
items in which the users interested. Negative keywords 
represent the information items that the users do not 
want to retrieve. Neutral keywords mean that these 
keywords are not useful for determining the users' in- 
terests . Eq.( | l[) is develop ed based on the keyword clas- 
sifier ( Kindo et al. 1997 ). It can be used to induce the 
preference value pre(k) of a keyword k, and classify it 
as positive, negative, or neutral. 



df(k) 



pre(k) = e x tanh 

(p{k rel ) tanh £g=ii - (1 - p(k rel )) tanh ) 

(1) 

where e is used to restrict the range of pre(k) such 
that —1 < pre(k) < 1. The examples illustrated in this 
paper assume that e = 0.9. df(k) is the sum of the 
number of relevant documents df(k re l) and the number 
of non-relevant documents df(k nre i) that contains the 
keyword k, and tanh is the hyperbolic tangent. The 
rarity parameter £ is used to control rare or new key- 
words and is expressed as mt(logN +1), where N is 
the total number of Web documents judged by a user, 
and int is an integer function that truncates the decimal 
values. p(k re i) is the estimated probability that a docu- 



ment containing keyword k is relevant and is expressed 
as the fraction d ^ k ^yp§^ — jy- Prei is the estimated 
probability that a document is relevant. In our system, 
it is assumed that the probability that a Web document 
presented by the filtering agent and judged as relevant 
by a user is p re i — 0.5. A positive value olpre(k) implies 
that the associated keyword is positive, whereas a neg- 
ative value of pre(k) indicates a negative keyword. If 
pre(k) is below a threshold value A, the associated key- 
word is considered neutral. It is assumed that A = 0.5 
for the examples demonstrated in this paper. Basically 
a positive keyword k is mapped to pkw(k), and a neg- 
ative keyword k is mapped to ^pkw(k). There is no 
need to create the symbolic representations for neutral 
keywords. For pkw(k) or ->pkw(k), the entrenchment 
rank B(afc) of the corresponding formula is defined 
as: 



B(o fc ) = 



\pre{k)\ 




if |pre(fe)| > A (2] 
otherwise ^ ' 



The following is an example of computing the en- 
trenchment rank B(afe) from a set of judged Web doc- 
uments. It is assumed that there are a set of five doc- 
uments having been judged as relevant (i.e. \D + \ = 5) 
and another set of five documents having been judged 
as non-relevant by a user (i.e. \D~\ = 5). Each doc- 
ument is characterised by a set of keywords e.g. d\ = 
{business, commerce, system}, d2 = {art, sculpture}. 
Table | summarises the frequencies of these keywords 
appearing in both D + and D~ , their preference values, 
and the entrenchment ranks of corresponding formulae. 



Table 1: Representation of users' information preferences 



Keywords 


D + 


D~ 


pre(k) 


Formula: ct £ 


Rank:B(a fc ) 


business 


5 





0.856 


pkw (business) 


0.856 


commerce 


4 





0.836 


pkw (commerce) 


0.836 


system 


2 


2 









art 





5 


-0.856 


—ipkw(art) 


0.856 


sculpture 





3 


-0.785 


-^pkw (sculpture) 


0.785 


insurance 


1 





0.401 







Learning and Adaptation 

Whenever a user provides relevance feedback for a 
presented Web document, the belief revision process 
can be invoked to learn the user's current information 
preferences. Conceptually, the filtering agent's learn- 
ing and adaptation mechanism is characterised by the 
belief revision and contraction processes. For exam- 
ple, if r = {ai, d2, • ■ ■ , a n } is a set of formulae repre- 
senting a Web document d G D + , the belief revision 
process (((-SQ^ )„„)•• is invoked for each a % G T, 
where K is the belief set stored in the filtering agent's 
memory On the other hand, the belief contraction 
process (((K~ ± )~ 2 ) . . .)~ is applied for each a, G T 



and d G D~ . The sequence of revising or contract- 
ing the set of beliefs a\,a2, ■ ■ ■ ,a n is determined by 
their entrenchment ranks and whether it is a revi- 
sion or contraction operation. At the computational 
level, belief revision is actually taken as the adjust- 
ment of entrenchment ranking B in t he theory base 
exp(B). Partic ularly, maxi- adjustment ( Williams 1996] ; 



Williams 199S| ) is employed by the learning component 
of the filtering agent to modify the ranking of its beliefs 
in an absolute minimal way under maximal informa- 
tion inertia. As the input to the maxi-adjustment al- 
gorithm consists of a sentence a and its entrenchment 
rank i, the procedure described in the Knowledge Rep- 
resentation Section is used to induce the new rank for 
each a G T. Moreover, for our implementation of the 
maxi-adjustment algorithm, the maximal ordinal O in 
the interval [0, O] is chosen as 1. 

One advantage of a symbolic representation of 
the filtering agent's domain knowledge is that 
semantic relationships among keywords 
captured. For example, if the keywords 
and commerce are taken as synonymous, 
mantic relationship can be modelled as a 
pkw(business) = pkw (commerce) in the 
agent's memory. Moreover, classification knowledge 
such as sculpture is a kind of art can also be used by 
specifying the rule such as pkw (sculpture) — > pkw(art). 
It is believed that capturing the semantic rela- 
tionships among keywords can impr ove the effec- 

1995|; 



can be 
business 
this se- 
formula 
filtering 



tiveness of the matching p rocess ( Eunter 



Nie, Brisebois, fc Lepage 1995| ). In fact, by employing 



maxi-adjustment as the filtering agent's learning mech- 
anism, these semantic relationships can be reasoned 
about during the reinforcement learning process. This 
could lead to a higher level of learning autonomy since 
changes to related keywords can automatically be 
inferred by the filtering agent. As a result, less users' 
relevance feedback may be required. The following 
examples assume that the formulae pkw (business) 
= pkw (commerce) and pkw (sculpture) — > pkw (art) 
have been manually added to the the agent's memory 
through a knowledge engineering process. 

Example 1: 

The first example shows how adding one belief to the 
agent's memory will automatically raise the entrench- 
ment rank of another related belief. It is assumed that 
the belief H(^pkw(sculpture)) = 0.785 and the belief 
H (pkw (business)) = 0.856 have been learnt by the fil- 
tering agent. If several Web documents characterised 
by the keyword art are also judged as non-relevant by 
the user later on, the preference value of the keyword 
art can be induced according to Eq. ( []) . Assuming that 
pre(art) = —0.856, the corresponding entrenchment 
rank can be computed as H(^pkw(art)) = 0.856 ac- 
cording to Eq.(||). By applying H*(->pkw(art), 0.856) 
to the theory base exp(H), the before and after im- 
ages of the agent's explicit beliefs (i.e. exp(H)) can be 
tabulated in Table 0. Based on the maxi-adjustment al- 



4 



gorithm, B + (a,i)(/3) 
0). 



i if B(/3) < i < degree(B, a 



V Ti(-ipkw(sculpture)) < 0.856 < 

de<?ree(B, -ipkw(art) — > -^pkw (sculpture)) 
.'. B + (^pfcu>(art), 0.856)(-ipfcw(scMZpture)) = 0.856 

The implicit belief -^pkw(art) — > -^pkw (sculpture) 
in conient(B) is derived from the explicit belief 
pkw (sculpture) — > pkw(art) in the theory base exp(B), 
and its degree of acceptance is 1 according to Defi- 
nition ||. As the belief ^pkw(art) implies the belief 
-^pkw (sculpture) and the agent believes in ^pkw(art), 
the belief -^pkw (sculpture) should be at least as en- 
trenched as the belief ~^pkw(art) according to (PERI) 
of Definition |l| or (EE2). In other words, whenever 
the agent believes that the user is not interested in art 
(i.e. ~^pkw(art)), it must be prepared to accept that the 
user is also not interested in sculpture at least to the 
degree of the former. The proposed learning and adap- 
tation framework is more effective than other learning 
approaches that can not take into account the semantic 
relationships among information items. This example 
demonstrates the automatic revision of the agent's be- 
liefs about related keywords given the relevance feed- 
back for a particular keyword. Therefore, less users' 
relevance feedback may be required during reinforce- 
ment learning. Consequently, learning autonomy of the 
filtering agent can be enhanced. 



Table 2: Raising related beliefs 



Formula: a 


B(a) Before 


B(q) After 


pkw(business) = pkw (commerce) 


1.000 


1.000 


pkw(sculpture) — » pkw(art) 


1.000 


1.000 


pkw (business ) 


0.856 


0.856 


-^pkw (sculpture) 


0.785 


0.856 


-^pkw(art) 





0.856 



Example 2: 

The second example illustrates the belief contrac- 
tion process. In particular, how the contraction of one 
belief will automatically remove another related belief 
from the agent's memory if there is a semantic rela- 
tionship between the underlying keywords. Assuming 
that more Web documents characterised by the keyword 
sculpture are judged as relevant by the user at a later 
stage, the belief B(pkw(sculpture)) — 0.785 could be 
induced. As B*(a, i) = (B~ (-ia, 0)) + (a, i) if i > j m , 
where i = 0.785 and j m = in this example, B*( 
pkw (sculpture) , 0.785) leads to the contraction of the 
belief ^pkw (sculpture) from the theory base exp(B). 
Moreover, B~( -i pkw (sculpture), 0) (-> pkw(art)) = 
is computed because B( -i pkw( art)) = 0.856 = j m - n 
and the set {7 : B~( -1 pkw(sculpture), 0) (7) > 0.856 } 

U { -1 pkw(art) } I 1 pkw (sculpture) is obtained. The 

before and after images of the filtering agent's explicit 
beliefs are tabulated in Table |3|. 

Example 3: 



Table 3: Contracting related beliefs 



Formula: ct 


B(a) Before 


B(oj) After 


pkw(business) = pkuj(coramerce) 


1.000 


1.000 


pkw(sculpture) — > pkw(art) 


1.000 


1.000 


pkw (business) 


0.856 


0.856 


pkw (sculpture) 





0.785 


-^pkw (sculpture) 


0.856 


O 


-^pkw(art) 


0.856 


O 



This example demonstrates how multiple sentences 
from the same Web document judged by a user can 
be contracted from the agent's memory. If some 
Web documents characterised by sculpture and busi- 
ness have recently been judged as non-relevant by 
the user, both the belief H(pkw(sculpture)) = and 
the belief B (pkw -(business)) — could be induced. 
Since beliefs with minimal entrenchment rank (i.e. 0) 
are not supposed to be stored in the agent's mem- 
ory, maxi-adjustement for these two beliefs is still re- 
quired so that they can be removed from the agent's 
memory. Basically, the contraction process B~(a, i) 
is applied to both sentences. As opposed to the 
belief expansion process where the most entrenched 
sentence is applied to the agent's memory first, the 
least entrenched belief is first contracted from the 
agent's memory for belief contraction. Consequently, 
B~ (pkw '(sculpture) ,0) is invoked first. As there is 
not other logically related sentences in the theory base 
exp(B), the belief pkw (sculpture) is simply removed 
from the agent's memory. Similarly, the sentence 
pkw(business) is also removed from exp(B) by apply- 
ing B~ (pkw(business) , 0). The before and after images 
of the filtering agent's explicit beliefs are tabulated in 
Table |. 

Table 4: Contracting multiple beliefs in one cycle 



Formula: a 


B(oi) Before 


B(a) After 


pkw(business) = pkw(commerce) 


1.000 


1.000 


pkw(sculpture) — * pkw(art) 


1.000 


1.000 


pkw (business ) 


0.856 


O 


pkw (sculpture) 


0.785 






Filtering Web Documents 

In our current framework, the matching function of the 
filtering agent is modelled as logical deduction. More- 
over, a Web docume nt is taken as the conjunction 
of a set of f ormulae ( Chiaramella fc Chcvallct 1992] ; 
Hunter 1995| ). The following example illustrates the 



agent's deduction process with reference to previous ex- 
amples. The resulting belief sets K — Cn(exp(R)) from 
examples 1, 2, and 3 in the previous section are used 
to determine the relevance of the following three Web 
documents: 



5 



4> = {pkw (business) A pkw(art)} 

ip = {pkw (sculpture) A pkw(art)} 

ip = {pkw (business) A pkw (commerce)} 

The filtering agent's conclusions about the relevance 
of the Web documents are summarised as follows: 

Time: (tl) 

K = Cn({pkw(business) = pkw(commerce) , 
pkw (sculpture) — > pkw(art), 
pkw (business), -<pkw (sculpture), -^pkw(art)}) 
K\f<p, K\/Lp, Khifj 

Time: (t2) 

K = Cn({pkw(business) = pkw (commerce), 
pkw (sculpture) — ► pkw(art), 
pkw(business) , pkw (sculpture)}) 
K\-<j>, K h (p, Khip 

Time: (t3) 

K = Cn({pkw(business) = pkw (commerce), 
pkw (sculpture) — ► pkw (art)}) 
K\/<P, K\f<p, K\/i> 

As can be seen tentative conclusion drawn at time tl 
may not hold when new information e.g. ^ to((m!iH 
is processed by the agent at time t3. Strictly speak- 
ing, the deduction process of the agent should be de- 
scribed as K ip, where ^ is a nonmonotonic in- 
ference relation because the inferred beliefs (i.e. con- 
clusions) will not grow monotonically. It is not dif- 
ficult to see that this should belong to the class 
of non monotonic inference called exp ectation inference 
(H<) (Gardenfors & Makinson 1994). The basic idea 
of expectation inference is that given a sentence a of 
a propositional language C, if a and the subset of sen- 
tences in a belief set K that is consistent with a (i.e. 
{a} U {(3 £ K : -^a < j3}) can classically entail another 
sentence (3, a (-^ (3 can be deduced. With reference to 
our examples, since K classically entails or not entails 
<p, if and ip, and K U K = K , the definition of expec- 
tation inference can trivially be applied to describe the 
characteristics of the inference mechanism in the filter- 
ing agent. Therefore, K |^ <p, where K is the filtering 
agent's belief set and <p is the logical representation of 
a Web document, represents an inference conducted by 
the adaptive filtering agent. 

Discussion 

An alternative approach for developing the filtering 
agent's learning mechanism is to apply the (C-) or the 
(C-R) condition to revise the agent's beliefs. It has 
been stated that both the (C-) and the (C-R) condi- 
tions can be used to construct the same class of belief 
revision functio ns that satisfy the AGM postulates for 
belief revision ( pardenfors 1992 ). The following ex- 
ample illustrates how belief revision may be conducted 



based on the (C-R) condition. Assuming that the fil- 
tering agent's initial belief set K is as follows: 

K = {pkw(business),pkw(commerce), 
pkw (sculpture), pkw (art)} 

Moreover, 

pkw(business) ^ pkw(commerce) ^ 
pkw (sculpture) ^ pkw(art) 

If a user perceives some Web documents characterised 
by the keyword business as non-relevant, the revision 



process Kl 



<pkw {business) 



will be invoked. Based on the 



Levi Identity, the sentence pkw(business) should first 
be contracted from the belief set K . Consequently, 
all the beliefs from K will be contracted according to 
the (C-R) condition. The resulting belief set becomes 
K = {->pkw(business)} . Nevertheless, a user who 
does not require information objects about business 
may still be interested in information objects about 
art and sculpture. Therefore, applying (C-R) or (C-) 
to construct belief contraction or belief revision func- 
tion seems producing drastic changes in the context of 
information retrieval and filtering. In fact, term in- 
dependence is often assumed in information retrieval. 
This intuition is reflect ed in the vector space model 
of information retrieval (Salton & McGill 1983), where 
changing the weight of a particular keyword may not 
affect the others in the weight vector. Therefore, the 
maxi-adjustment strategy produces a better approxi- 
mation in terms of revising or contracting beliefs about 
information objects with respect to a user's information 
needs. 

Under the current framework, domain knowledge 
such as semantic relationships among information ob- 
jects is transferred to the filtering agent's memory 
through a knowledge engineering process. Since not 
all semantic relationship is highly certain (i.e. assigning 
the maximal entrenchment rank) , by applying belief re- 
vision to the agent's memory, the corresponding beliefs 
may be contracted from the memory over time. There- 
fore, domain knowledge perhaps needs to be transferred 
to the agent's memory periodically in accordance with 
the postulates of epistemic entrenchment. This can be 
taken as an off-line process to minimise its impact on 
the availability of the filtering agents. However, further 
investigation is required to apply such a background 
learning process to the filtering agents. 

The belief set K = content(B) is actually used by the 
agents to infer the relevance of Web documents. On the 
other hand, maxi-adjustment is employed to revise the 
theory base exp(B) and to maintain its consistency af- 
ter applying changes. Though maxi-adjustment ensures 
that the revised theory base is consistent, it is possible 
that the belief set K becomes inconsistent (i.e. K h _L) 
after applying changes such as B*(a,z). The following 
is a classical example to explain such a problem. It is 
assumed that the set of explicit beliefs exp(B) as well 
as their entrenchment ranking is as follows: 



G 



p-r.p(TK) = {\/x[ppngnin(T.) — > hird(-r)~\ 



^p) ([Dubois, Lang, fc Parade 1993 ; Dubois, Lang, & 
Prade 1994| ). In possibilistic logic, the inconsistency 



\fx[penguin(x) — + ->fly(x)], 
\fx[bird(x) —> fly(x)}} 

Ti(\lx[penguin(x) —> bird(x)]) = 0.9, 
T5(yx[penguin(x) — > ~>fly(x)]) = 0.7, 
B(\/x[bird(x) -> A/(a:)]) = 0.4 

In the context of information retrieval and filtering, 
penguin{x) — > bird(x) can be interpreted as: if a user 
is interested in information objects about penguin, it 
is likely that the user is also interested in informa- 
tion objects about bird. The other formulae in exp(B) 
can be interpreted in similar way. Given the fact that 
the user is interested in tweety which is a penguin i.e. 
penguin(tweety) and H(penguin(tweety)) = 0.8, by 
applying maxi-adjustment B* (penguin(tweety) , 0.8), 
the revised entrenchment ranking newB will be: 

newH(\/x[penguin(x) — > 6ird(x)]) = 0.9, 
newli(penguin(tweety)) = 0.8, 
new~B(\fx[penguin(x) — > -'/^(x)]) = 0.7, 
newB(Va;[6ir<f(x) -> fly(x)}) = 0.4 

The degree of acceptance of implicit beliefs is com- 
puted based on definition |^: 

degree(newH, -> fly (tweety)) = 0.7, 
degree(newTi, fly(tweety)) = 0.4 

As can be seen, even though exp(newTi) Vf J_, it 
is clear that content (newH) h _L, where h is classical 
dcrivability relation. If the filtering agent employs the 
belief set K — content(newH) to deduce the relevance 
of Web documents, any documents will be considered 
as relevant. This problem must be addressed before the 
filter agents can be put to practical use. One possible 
solution is to make use of the degree of acceptance of 
beliefs to produce the largest cut of content(B) so that 
it does not entail _L. In other words, only the set of 
consistent beliefs A = {(3 G K : _L < /?}, where < is the 
strict part of epistemic entrenchment, will be used by 
the filtering agent to infer the relevance of Web docu- 
ments. Similar idea has been e xplored in developing the 
expectation inference relation ( pardenfors fc Makinson 
199|). For instance, a ^ 7 iff 7 e Cn({a} U {j3 G 
A : ->a < /?}) is developed based on A = {$ : _!_ < /?}. 
So, with reference to the above example, after applying 
B* (penguin(tweety), 0.8), the agent should only make 
use of the following set of beliefs for reasoning: 

A = {\/x[penguin(x) — > bird(x)], 
penguin(tweety) , 
bird(tweety), 

yx[penguin(x) — » ~<fly(x)}, 
-. fly (tweety)} 

Therefore, the agent can conclude that the user 
is interested in information objects about non-flying 
tweety. The above reasoning process can also be ex- 
plained based on nontrivial possibilistic deduction (| 



degree Incons(F) of a possibilistic knowledge base F 
is defined as the least certain formula involved in the 
strongest contradiction of F. Moreover, nontrivial pos- 
sibilistic deduction is defined as: F (a, m) iff F \= 
(a,m) and m > Incons(F). If the entrenchment rank 
of a formula a is taken as the certainty m of a possi- 
bilistic formula, Incons(K) = 0.4 with reference to the 
above example. By employing possibilistic resolution, 
K \= (fly(tweety),0A) and K \= (-> fly (tweety), 0.7) 
can be obtained, where |= is possibilistic entailment. 
Since the certainty degree m of ->fly(tweety) equals 0.7 
and is greater than Incons(K), K —>fly (tweety). 
Nevertheless, fly (tweety) can not be deduced from K 
based on (\^p) because m = 0.4 y£ Incons(K). How- 
ever, further investigation is required to apply possi- 
bilistic based inference to the matching components of 
adaptive information filtering agents. 

Conclusions 

The AGM belief revision paradigm offers a powerful and 
rigorous foundation to model the changes of an agent's 
beliefs. The maxi-adjustment strategy, which follows 
the AGM rationale of consistent and minimal belief 
changes, provides a robust and effective computational 
mechanism for the development of the filtering agents' 
learning components. As semantic relationships among 
information items can be reasoned about via the maxi- 
adjustment method, less human intervention may be 
required during the agents' reinforcement learning pro- 
cesses. This opens the door to better learning autonomy 
in adaptive information filtering agents. The technical 
feasibility of applying the maxi-adjustment method to 
adaptive information filtering agents has been exam- 
ined. However, quantitative evaluation of the effective- 
ness of these agents needs to be conducted to verify the 
advantages of applying such a framework to construct 
the learning mechanisms of these agents. 
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